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Abstract 

Standard linguistic analysis of syntax uses the T-model. This 
model requires the ordering: D-structure > S-structure > LF, where 
D-structure is the sentences deep structure, S-structure is its surface 
structure, and LF is its logical form. Between each of these representa- 
tions there is movement which alters the order of the constituent words; 
movement is achieved using the principles and parameters of syntactic 
theory. Psychological analysis of sentence production is usually either 
serial or conncctionist. Psychological serial models do not accommo- 
date the T-model immediately so that here a new model called the P- 
model is introduced. The P-model is different from previous linguistic 
and psychological models. Here it is argued that the LF representation 
should be replaced by a variant of Frege's three qualities (sense, refer- 
ence, and force), called the Frege representation or F-representation. 



In the F-representation the order of elements is not necessarily the 
same as that in LF and it is suggested that the correct ordering is: 
F-representation > D-structure > S-structure. This ordering appears 
to lead to a more natural view of sentence production and processing. 
Within this framework movement originates as the outcome of empha- 
sis applied to the sentence. The requirement that the F-representation 
precedes the D-structure needs a picture of the particular principles 
and parameters which pertain to movement of words between repre- 
sentations. In general this would imply that there is a preferred or 
optimal ordering of the symbolic string in the F-representation. The 
standard ordering is retained because the general way of producing 
such an optimal ordering is unclear. In this case it is possible to pro- 
duce an analysis of movement between LF and D-structure similar to 
the usual analysis of movement between S-structure and LF. The ne- 
cessity of analyzing corrupted data suggests that a maximal amount 
of information about a language's grammar and lexicon is stored. 
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1 Introduction. 

1.1 Forward. 

For the purposes of theory language can be split up into segments: para- 
graphs, sentences and so forth. It is not always clear what the segments are 
in natural language, especially spontaneous speech, see for example Rischel 
(1992) p2| . Linguists and psycholinguists give priority to the analysis of 
sentences and approach this in different ways, for example, there are a vari- 
ety of ways of approaching sentence word order. Linguists usually approach 
word order by invoking the T-model in which various principles are used 
to change word order from a primitive form (D-structure) to the audible or 
written form (S-structure); linguists do not seem to realize that the T-model 
is a psycholinguistic processing model. Other ways of accounting for word 
order include the Markov cascade models of Brants (1999) ||. 

Linguistics is split up into several subdisciplines which include: phonol- 
ogy, morphology, social, historical, semantics, and syntax. According to 
James McCloskey (1988) @: 

the study of syntax has always been a more acrimonious busi- 
ness than the pursuit of [its] sister disciplines in linguistics. 

Modern syntax grew from the need to record the grammars of North Amer- 
ican Indians whose languages where rapidly becoming extinct, much of this 
work was done by Bloomfield in the 1920's. In the 1950's Chomsky started 
applying an analogy with pure mathematical category theory to syntax - 
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then syntax began to take its modern form. The way that syntactical in- 
vestigations usually take place is by analyzing contrived sentences, rather 
than naturally occurring sentences. Similarly psychology is split up into 
several subdisciplines which include: social, developmental, psychometric, 
neuropsychology, and cognitive. 

Linguistics and psychology are not as sociologically closely related as one 
might expect. For example, psycholinguists rarely produce phrase trees of 
their test sentences: this would be the starting point of any syntactic anal- 
ysis; an exception to ignoring the work of linguists being Hall (1995) I^Of] 
who discusses potential contributions of psycholinguistic techniques to Uni- 
versal Grammar. Similarly syntactians rarely refer to the measurements of 
psycholinguists. Some well known linguistic textbooks on syntax, for exam- 
ple Chomsky (1986) ||, invoke and usually start with a processing model 
called the T-model. Here I point out that this is indeed a psycholinguis- 
tic model, and can be subject to the methodology of that discipline. This 
model requires the ordering: D(eep) — Structure > S(ur face)— Structure > 
LF (logical form). Between each of these representations there is movement, 
described by various principles, which alters the order between constituent 
words. That grammar should relfect more closely the workings of the human 
parser has been sugg ested by Phillips (2001) Q and Richards (1999) @. 

Frege analyzed the meaning of a sentence to depend on three qualities: 
sense, reference, and force. In §2.2 it is argued that Frege's three qualities 
which describe a sentence: sense, reference, and force, should be replaced 
by the five qualities: external referents, lexical referents, formal declarants, 
formal string, and force. These qualities form a representation here called 
the F(rege)-representation. In the F-representation the order of elements is 
not necessarily the same as in the LF. It is then suggested that the correct 
ordering is: F — representation > D — structure > S — structure. This 
ordering leads to a more natural view of sentence production (or process- 
ing) called the P-model. Within this framework movement originates as the 
outcome of the emphasis applied to the sentence; rather than as it occurs in 
linguistic models where movement is unmotivated and ad hoc. The require- 
ment that the F-representation precedes the D-structure needs a picture of 
the principles and parameters which pertain to movement of words between 
representations. In general this would imply that there is a preferred or 
optimal ordering of the symbol string in the F-representation. The general 
way of producing such an optimal ordering is unclear; but might be found 
by invoking an extremal principle as discussed in §3.1. In §6 a new model 
called the P-model is presented, this model uses the standard ordering, as 
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the optimal ordering is still unknown. For the P-model it is possible to pro- 
duce an analysis of movement between LF and D-structure similar to the 
usual analysis of the movement between S-structure and LF. 

In §3.3 it is suggested that a maximal amount of information about a 
language's grammar and lexicon (vocabulary) are stored. At first sight this 
might seem inefficient, but could occur because it allows for a quick analysis 
of speech which is often only partial heard. 

1.2 Language Acquisition. 

McDonald (1997) |35] reviews how language learners master the formal 
structure of their language. She investigates Three possible routes to the 
acquisition of linguistic structure: firstly the use of prosodic and phonologi- 
cal information, secondly the use of function words to syntactically classify 
co-occurring words and phrases, and thirdly the use of morphology internal 
to the lexical items to determine language structure, and the productive re- 
combination of these subunits in new items. Evidence supporting these three 
routes comes from normal language acquirer's and from several special pop- 
ulations, including learners given improvised output, learners with Downs 
syndrome, and late learners of first and second languages. Further evidence 
for the three routes comes from artificial language acquisition experiments 
and computer simulations, see also Ferro et al [|ll|]. Language acquisition 
has also been reviewed by Gleitman and Bloom (1998) p!q| . There is also 
the problem of how the segments of language, as discussed in §5.1, occur 
in language acquisition. Josephson and Blair (1998) |^6[ view language ac- 
quisition primarily as an attempt to create processes that connect together 
fruitfully linguistic input and other activity. From a philosophers point of 
view this is a coherence theory of language, see the discussion in Roberts 
(1998) ^3|§6.1. Hymans (1985) |22|| d iscusses how parameters are set in 
language acquisition. Valian (1990) [|52[ argues that in a child's acquisition 
of whether to have a null subject or not, there is an initial dual switch allows 
both options; the previously accepted model being either null subjects or not 
at one value being predetermined at one choice. There is also the question 
of how language evolved in the first place, the main differences being over 
whether most of it evolved recently or whether it evolved gradually, see the 
review of MacWinney (1998) @. 
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Figure 1: Diagram One - The T- model. 
1.3 The T-Model. 

The T-model is the basic framework within which linguistic syntax is cur- 
rently understood. There are various levels with different word order, the 
word order being altered by movement. The object of linguistic syntax is to 
find the rules which describe movement. The T-model is usually pictured 
by diagram one of an upside down Y, (not an upside down T from which 
its name derives) see for example: Chomsky (1986) || p. 68, Cook (1988) 
§ p.31, Haegeman (1994) ||] p.493, Hornstein (1995) f2| p.2. Brody 
(1998) 0] presents a system of principles relating the LF representation to 
lexical items that are compatible to his assumption of no externally forced 
imperfections in syntax; his assumption is a generalization of the linguistic 
projection principle. 

The T-model illustrated in diagram one consists of several levels. The 
D-structure (deep structure) level is supposed to hold a given sentence in a 
primitive form. The D-structure level cannot be the same as the message 
level in serial psychological models as there individual words are supposed 
to be already delineated. In the Garret model (1980) JT^ ] (see also Garman 
(1990) |y| p. 394) the D-structure level could be interpreted as being about 
midway in the sentence level. This level is then subject to various rules 
pertaining to how the order of the words can be moved. These rules make up 
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Diagram Two 
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Figure 2: Diagram Two - Quantifier Raising 



the bulk of the principles and parameters approach to linguistic syntactical 
analysis. The communicative purpose of such movement is to alter the 
emphasis of D-structure. The consequence of movement on the D-structure 
level is to produce another level called the S-structure (surface structure) 
level. The S-structure sentences are physically realized by the PF (phonetic 
form) and this corresponds to the positional level representation in serial 
models. To produce a post-hoc analysis of S-structure for its formal content 
it is postulated, by analogy with movement from S to D-structure, that 
movement again occurs to bring the sentence into its LF (logical form) . The 
"T" diagram is sometimes made more complex by the addition of other 
factors, e.g. Cook(1988) |?J p. 33. Here movement from S-structure to LF 
is illustrated with examples of quantifier-raising and Wh-raising taken from 



Haegeman (1994) g9| Ch.9. 



1.4 Quantifier- Raising. 

Consider the surface structure sentence (compare Haegeman (1994) [jOJ 
p.489 eq.3): 

Jones saw everyone. (1) 

To this surface structure word order movement is applied to yield the LF 
word order given by the string This can be represented by diagram two 
which can be expressed in terms of symbols 
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Vx, (xeH — > JeSx), 



(2) 



where H,J and S denote "human", "Jones", and "saw" respectively, and V 
denotes 'for all'. Using traces this can be expressed in terms of words 

For all x it is the case that if x is human then Jones saw x. (3) 

or in terms of the quantifier everyonei 

\Everyonei\J ones saw x,j]]. (4) 

The idea here is that the universal quantifier everonei can be put first in 
the LF representation. The universal quantifier could also be put last, but 
by convention it is taken to come first. To achieve this there is movement 
from everyonei being last to being first in the S-structure this leaves a trace 
Xi in place of everyonei 



1.5 Wh- Raising. 

Consider the sample surface structure question (compare Haegeman (1994) 
[l| p.494 eq.9): 

Who did Jones see? (5) 

In this case the word order of the surface structure and the LF remain the 
same. In term of symbols 

Wh(x) , xeH ,JeSx (6) 

where Wh(x) denotes "Who Using traces this can be expressed in 

terms of words 

For which x, x is human, is it the case that Jones saw x? (7) 
This has S-structure representation 

[cpWhoi did[ip Jones see ij]]? (8) 

and LF representation 

[cpWhoi did[ip Jones see Xi]]l (9) 

The S-structure and the LF have the same form, but with different traces. 
The S-structure ordering of Wh-phrases is not the same in all languages; in 
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some Wh-words do not appear on the left, so that in these cases both the 
S-structure and the LF would not have the same form: having LF depending 
on specific languages is contrary to its name. There can be ambiguity in 
the scope (or domain of applicability) of the quantifiers; this is illustrated 
by the sentence (compare Haegeman (1994) [19] p. 490 eqs.4 and 5) 



Everyone saw someone. (10) 
which has the two interpretations: 

For every x there is some y such that it is the case that x saw y. (11) 
and 

There is some y, such that for every x, it is the case that x saw y. (12) 

2 F-level Generalization of Logical Form. 

2.1 Drawbacks with the Standard Approach 
to Logical Form. 

The notion of logical form is described in May (1985) |3^] and Hornstein 
(1995) [ 21 1 , Stanley (1998) |47| discusses the origin of logical form. Logical 



form as currently understood has at least two drawbacks. The first is that 
it is restricted - in the sense that it is based on the simple calculi of logic 
such as the predicate and propositional calculi. For example sentences such 
as 

The probability of snow is 80%. (13) 

require an understanding of probability and hence of real numbers, 7Z, which 
have continuous properties as opposed to the discrete properties of sim- 
ple calculi and standard LF (logical form). This implies that sentence ^ 
requires a larger formal structure, encompassing real numbers 1Z, than is 
usually included in LF. Sentence [L3| can be represented 

Prob, pe[0,l], (snow = 0.8) (14) 

with similar representations for other formal mathematical statements such 
as occur in fuzzy logic. The second drawback is that a string such as 

(xeH -» Je5x)Vx, (15) 
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unambiguously means the same as equation [2 ; but the symbol order is dif- 
ferent, \/x (for all x) coming first in equation 2| and last in equation 15. In 
the principles and parameters approach the order of the symbols is essential, 
otherwise it is hard to know where to insert traces, compare §1.4&1.5. 



2.2 A New Approach to Logical Form. 

To overcome these drawbacks consider Frege's approach to the meaning of 
a sentence. To quote Dummet (1973) [jlO| p. 83 

Frege drew, within the intuitive notion of meaning, a distinction 
between three ingredients: sense, tone and force. 

Here the variation of these that is used can be represented by diagram three: 
Reference and force are essentially unchanged, but sense is decomposed into 
three. Lexical referents are words and the concomitant symbol used in the 
formal language. Formal declarants are similar to the beginning of computer 
programmes where the following three things are specified the language used, 
the parameters used, and the scope (local or global, or other specified order- 
ing to give precedence in case of ambiguity) are specified. The formal string 
is an ordered set of symbols which are well formed (well defined) in the lan- 
guage. For example the sense for equation [l] would be decomposed as follows 



Lexical Referents 
Formal Declarants 
F or mal String 



g = Jones, S = saw, h = human, 
Predicate Calculus, xeH, 
Vx(xeH -» JeSx). 



(16) 



Another advantage of the new approach is that sentence [R] is ambiguous 
leading to either |ll] or 12. This ambiguity can be removed by limiting the 
scope of the variables in the formal declarants. 



3 Maximal Verses Minimal Encoding of Informa- 
tion. 

3.1 Extremal Principles. 

In physics there are least action principles which, when the action is mini- 
mal, give differential equations which describe the dynamics of systems. The 
analogy has been carried through to other areas of science, see for example 
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Diagram Three 



Frege 



Sense 



Tone 



Force 



Here: 




Reference 



External Referents 
(context) 



Sense 



Lexical Referents 
Formal Declarants 
Formal String 



Force 



Force 



Figure 3: Diagram Three - modification of Frege's sentence content from 
sense, tone & reference to five factors. 
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Roberts (1998) |44]]§3&§1^[3 and references therein. There is a minimalist 
program in theoretical linguistics, Lasnik (1998) [B^j and Culicover (1998) 
which invokes an economy principle where the steps, symbols and rep- 
resentations in the principle and parameters approach are minimal. Gibson 
(1998) (l(| proposes a theory which invokes economy of processing; this the- 
ory relates sentence processing to available computational resources. The 
computational resources have two components, firstly an INTEGRATION COST 
component and secondly a memory COST component which are quanti- 
fied in the number of syntactic categories that are necessary to complete the 
current input string as a grammatical sentence. These cost components are 
influenced by locality which entails both 1) the longer a predicted category 
must be kept in memory before the prediction is satisfied, the greater the 
cost for maintaining that prediction and 2) the greater the distance between 
an incoming word and the most local head or dependent to which it attaches, 
the greater the integration cost. Gibson claims his theory explains a wide 
range of processing complex phenomena not previously accounted for by a 
single theory. Lee and Wilks (1999) |2^] suggest that it is implausible that 
there is a highly nested belief structure computing the nature of speech acts, 
rather there is a minimal set of beliefs. The garden path model §5.4 uses a 
minimum principle. 

3.2 Minimal Principles and Word Order. 

If a principle and parameters approach is going to be used in order to legislate 
movement between the F-level and the sentence level, then some order must 
be given to both the elements of the formal declarants and the formal string. 
It is difficult to justify such an order a priori: an optimistic hope is that 
it could be explained by a minimum encoding of information and thus to 
economy of processing. Sentences, both formal and informal, can contain 
redundant information. For example in the predicate calculus strings which 
are always true can be added to a given string without effecting the resulting 
truth value. It is hard to see how this could be compatible with a minimum 
encoding of information. Also sentences can be expressed in several ways. 
For example in the predicate calculus using the four connectives {and, or, 
not, implies}, or by using the one connective {Sheffer stroke} or {Pierce 
symbol}, see for example Prior (1962) p0| p. 31. For the purposes of the 
P-model in §|| it is assumed that an ordering of the familiar (or standard) 
type as used in §[j] can be used. For other approaches to word order see 
Downing and Noonan (1995) @, and Bozsahin (1998) f§. 
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3.3 The Maximal Encoding of Information. 

It is suggested that a maximal amount of information about a language's 
grammar and lexicon (vocabulary) are stored. This is also an extremal prin- 
ciple however it is the precise opposite of more common minimal principles. 
From the point of view of language acquisition, see §1.2, what happens is 
that when a lexical item (a particular sample word) is first heard under- 
standing of it is limited, so that it is only partially learnt, resulting in it 
being only used in limited contexts. As exposure to the lexical item in- 
creases more about its semantic and grammatical properties are learnt so 
that it can be used in wider contexts. This fits in with aptitude tests for 
word meaning, where understanding of word nuance is more important than 
understanding esoteric words. What is happening here is that a maximum 
encoding of information about the word is taking place. Generalizing, this 
mechanism happens not only to lexical items, but to many other aspects of 
language, such as understanding of intricate grammatical structures, this is 
how linguistic performance is learnt, compare Garman (1990) §3.1.2. At 
first sight requiring maximal information about a lexical item might seem 
inefficient and contrary to economy principles, but could occur because it 
allows for a quick analysis of corrupted data, such as speech, which is often 
only partial heard. Also maximal storage would aid the very fast processing 
of language. What is a minimum and what is a maximum has to be kept 
track of: although the information stored about a lexical item might be 
maximal, the method of obtaining this information could be minimal. 

4 Psycholinguistics Models of Word Production. 

4.1 Serial verses Connectionist models of word production. 

Psychological models of a segment (or part) of a language come in two basic 
types: serial and connectionist. Serial models work like a serial computer 
programme with each operation being performed sequentially. Connection- 
ist models have objects which interact to alter one another's connection 
weights. These ideas can be applied to whole sentences or individual words. 
For the T-model or something similar to work there must be serial process- 
ing at a late stage in sentence production, because of the discrete nature 
of its lexical items and representations, see diagram one. This does not 
preclude connectionist processing before the D-structure representation, or 
even before the understanding of an individual word. There are PROLOG 
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Figure 4: Diagram Four - The Stemberger Diagram. 



models of restricted sentence production, Johnson and Klein (1986) (24|, pSj ], 

4.2 The Stemberger Interactive model of individual word 
production. 

There are psycholinguistic models of individual word production. There is 
evidence that the reception of speech is interactive, for example there is evi- 
dence that seeing the speaker speak influences the word heard, McGurk and 
McDonald (1976) [Q; also Tanenhaus et al (1995) [M examine how visual 
context influences spoken word recognition and mediated syntactic process- 
ing, even during the earliest moments of language processing. Furthermore 
there is evidence that verbal speech production interacts with gesture, Mc- 
Neil (1985) p^j , and various other physiological activities, Jacobson (1932) 
[p3| p. 692, and these observations suggests that the verbal part of speech pro- 
duction is interactive. A model which allows both phonological and semantic 
influences to interact is the psycholinguistic interaction speech production 
model, c.f. Stemberger (1985) [flq] . In this model, when the word "feather" 
is activated a lot of other words are also activated with varying weights ac- 
cording to how closely they resemble "feather". This can be pictured by 
diagram four, to quote Stemberger's p. 148 text: 
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Semantic and phonological effects on lexical access. ... an arrow 
denotes an activating link, while a filled circle is an inhibitory 
link. A double line represents a large amount of activation, a 
single solid line somewhat less, and a broken line even less. Some 
of the inhibitory links have been left out ... for clarity. The 
exact nature of semantic representation is irrelevant here, beyond 
the assumption that it is composed of features; ... a word in 
quotation marks represents its meaning. 

There is suppression (also called inhibition) across a level, and activation 
up or down to the next level. This model accounts for syntax by giving 
different weights to the different words so that words on the left come first. 
Speech errors come from the noise in the system. There are three kinds of 
noise. The first is that the resting level of a unit node is subject to random 
fluctuations; with the result that it is not the case that the unit nodes degree 
of activation remains at the base line level. A fluctuation could produce a 
random production of a part of a word. The second is that words that are 
used with a high frequency have a higher resting level, and therefore reach 
activation threshold, or "pop out", quicker. This implies that there should 
be less error for these high frequency words; furthermore it implies that when 
real words occur as an error, higher frequency words should occur as errors 
more often, and this does not happen. The third is the so-called systematic 
spread of activation; this means that the weights in the interaction allow an 
inappropriate activation of word. There are connectionist programmes of 
word recognition, e.g. Seidelberg and McClelland (1989) |M - 

4.3 Atomist Semantic Feature Models. 

There are atomist semantic feature models (Garman (1990) p. 388) to 
which serial models of individual word meaning could be built, however the 
connectionist interaction picture seems to have less drawbacks. From now 
on it is assumed that words in some concrete form can be assumed and the 
question becomes how are they related to make longer structure such as 
sentences. 

4.4 The Marslen- Wilson model of word recognition. 

Marslen- Wilson (1987) [^TJ claims that the process of spoken word recogni- 
tion breaks down into three basic functions: access, selection and integration. 
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Access is concerned with the mapping of the speech input onto the represen- 
tations of lexical form. At the earliest stages of the lexical access process, 
all lexical memory elements whose corresponding words begin with a par- 
ticular acoustic sequence will be fully activated, these activated words are 
called a COHORT, see also Garman (1990) |13| p. 286. Selection is concerned 
with the discrimination of the best fitting match to this input. Integration 
is concerned with the mapping of syntactic and semantic information at the 
lexical level onto higher levels of processing. Context comes into play after 
there is information about the syntactic and semantic properties of mem- 
bers of the cohort. Such models embody the concepts of multiple access and 
multiple assessment, allowing for a maximally efficient recognition process, 
based on the principle of the contingency of perceptual choice. 

4.5 The classification approach to word prediction. 

Zohar and Roth (2000) |53| say that the eventual goal of a language model 
is to accurately predict the value of a missing word given its context. They 
present an approach to word prediction that is based on learning a represen- 
tation for each word as a function of words and linguistics predicates in its 
context. They address a few questions that this approach raises. Firstly in 
order to learn good word representations it is necessary to use an expressive 
representation of the context. They present a way that uses external knowl- 
edge to generate expressive context representations, along with a learning 
method capable of handling the large number of features generated this 
way that can, potentially, contribute to each prediction. Secondly since the 
number of words "competing" for each prediction is large, there is a need to 
"focus the attention" on a smaller subset of these. They exhibit the contri- 
bution of a "focus of attention" mechanism to the performance of the word 
predictor. Finally they describe a large scale experimental study in which 
the approach presented is shown to yield significant improvements in word 
prediction tasks. 

5 Psycholinguistic Models of Sentence Production. 

5.1 Sentence production in larger structure. 

Research in psycholinguistics can be split up according to the size of the 
structure under study, for example text processing, sentence processing and 
word meaning. Here structure larger than sentences, for example discourse 
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(see Graesser et al [|17)), are not looked at. Sentence processing is con- 
cerned with how the syntactic structures of sentences are computed, and 
text processing is concerned with how the meanings of larger units of text 
are understood. Research in both domains has begun to use the information 
that can be obtained from a large corpora of naturally occurring texts. In 
text processing, recent research has focused on what information the words 
and ideas of a text evoke from long term memory quickly, passively, and 
at low processing cost; text processing is not looked at here. According to 
McKoon and Ratcliff (1998) |3^| in recent sentencing research, a new and 
controversial theme is that syntactic computations might rely heavily on 
statistical information about the relative frequencies with which different 
syntactic structures occur in language. Gerdemann and van Noord (1999) 



[15[ discuss various rewrite rules used in several areas of natural language 
processing, such rewrite rules might change word order. Hall (1995) |pCf| 
discusses the representations of various linguistic competences. Sentences 
can mean different things in different contexts, see Akman and Surav (1998) 
@] and MacDonald et al (1994) ||]. Ferro et al (1999) ff| produce learning 
transformation rules that find grammatical relations and find that gram- 
matical relations between core syntax groups bypasses much of the parsing 
phrase. Tabor et al (1997) |^] describe a model which works by analogy 
with dynamical systems. Attractors are taken simultaneously to have prop- 
erties of syntactic categories, with some encoding of context dependent lex- 
ical information. Various experiments were contrived which examined the 
interactions of simple lexical frequencies, and their results favoured their 
dynamical model over traditional approaches. Truswell et al (1994) (5l]] de- 
vise two eye-movement experiments which show that animate nouns where 
harder to disambiguate when parsing. 

5.2 The Clarke & Clarke serial model 
of sentence production. 

There are numerous serial models of sentence production and textbooks on 



the subject for example Rosenberg (1977) [45]. Here three, the Clarke and 
Clarke serial model, the Garret serial model, and the garden path model are 
very briefly presented before going on to the P-model. Clarke and Clarke || 
p. 278 mention the formulation of an articulatory [sic] program [sic] which 
has five steps: 

(1) Meaning Selection: The first step is to decide on the meaning the present 
constituent is to have. 
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(2) Selection of a Syntactic Outline: The next step is to build a syntactic 
outline of the constituents. It specifies a succession of word slots and indi- 
cates which slots are to get primary, secondary, and zero stress. 

(3) Content word selection: The third step is to select nouns, verbs, adjec- 
tives, and adverbs to fit the appropriate slots. 

(4) Affix and function word formation: With the content words decided on, 
the next step is to spell out the phonological shape of the function words 
(like articles, conjunctions, and prepositions), prefixes, and suffixes. 

(5) Specification of phonetic segments: The final step is to build up fully 
specified phonetic segments syllable by syllable. 

By step (5), the articulatory program [sic] is complete and can be executed. 
Typically, however, people monitor what they actually say to make certain 
it agrees with what they intended it to mean. Whenever they detect an 
error, they stop, correct themselves, and then go on. It seems likely that the 
more attention is required elsewhere - in planning of various sorts - the less 
likely they are to detect an error. Indeed, many tongue-slips go unnoticed 
by both speakers and listeners. 

5.3 The Garret serial model of sentence production. 

The Garret (1980) @ (see also Garman (1990) |J| p.394) has three basic 
levels: the message level, the sentence level, and the articulatory level. At 
the message level a mental model or image of what is about to be expressed 
is formulated. Loose ideas of the form of the individual words and the overall 
structure in which they are expressed are formulated to give the sentence 
level. Here the actual words to be used and the structure in which they 
occur are crystallized to give the positional level representation which is 
articulated. 

5.4 The Garden Path Model. 

More recent garden path models are reviewed in Prazier (1987) fllfl , who 
says on pages 561-562 (slightly adjusted): 

"In the garden path model, perceivers incorporate each word of an input 
into a constitute structure representation of the sentence, roughly as each 
item is encountered. At each step in this process, the perceiver postulates 
the minimal number of nodes required by the grammar of the language un- 
der analysis, given the structure assigned to preceding items. This leads to 
the two principles of the garden path model: 
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(1) Minimal Attachment: Do not postulate unnecessary nodes. 

(2) Late Closure: If grammatically permissible, attach new items into the 
clause or phrase currently being processed (i.e. the phrase or clause postu- 
lated most recently)." 

In other words minimal attachment entails that a perceiver, given so much 
of the beginning of a sentence chooses the minimal completion of it which 
makes sense both grammatically and semantically. The minimal attachment 
principle is related to the requirement that the ultrametric height of a sen- 
tence should be a minimum, see Roberts (1998) §3, which was written 
before I had heard of the garden path model. If sentences have X structure 
and hence binary branching at each node, see for example Roberts (1998) 



[44 1 §2, then the two notions are the same. Minimal attachment is also in 
accord with the minimal principles discussed in §3.1. McRae et al (1998) 
[j38f use time measuring experiments to see how event specific knowledge 
resolves structural ambiguity. Their results suggest that the structure of 
sentences is best described by a semantic constraint model, then a garden 
path model with a very short delay, and finally by a one region delay garden 
path model. Their models and experiments show that event specific knowl- 
edge is used immediately in sentence comprehension, and this agrees with 
maximal encoding of information in §3.3 above. 



6 The P-model. 

6.1 Description of the P-model. 

The P-model can be pictured by the diagram five. Semantic intent produces 
a F-level representation of the sentence and its context and force. Thought 
(semantic intent), occurs prior to words (lexical referents). The external ref- 
erents and the lexical referents combine to produce binding referents which 
constrain movement between the D-structure and S-structure representa- 
tion. The D-structure is constructed from the lexical referents (words) and 
the formal declarants and string. The reason that the D-structure sentence 
changes to the S-structure sentence is because of the emphasis (or Fregean 
Force) the speaker wishes to convey in the sentence. It is known, see the 
beginning of |4.2j , that in some cases gesture and other behaviour co-occur 
with spoken sentences: hence the Fregean force and the S-structured sen- 
tence interact to produce the gesture and emphasis concomitant with the 
audible production of the sentence. It is possible to produce variants of the 
above analysis. 
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Diagram Five 




A(udible)-Level Context Audible Sentence Gesture and Emphasis. 

Figure 5: Diagram Five - The P-model. 
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Diagram Six 



LF 



movement 



V 

D-structure 



Figure 6: Diagram Six - Quantifier Lowering 



6.2 The P-model from a principle & parameters viewpoint. 

From a principles and parameters view point: for the T-model movement 
between S-structure and LF has to be explained, however for the P-model 
movement between the F-level and D-structure has to be explained. Disre- 
garding the caveats of §2 concerning LF, the P-model requires an explana- 
tion of movement between LF and D-structure, as depicted by diagram six. 
This can be illustrated by choosing similar examples to §1.4,1.5, but now 
suggesting quantifier lowering and Wh-lowering. 

6.3 Quantifier Lowering. 

This can be represented by diagram six or in terms of symbols 



where H, J and S denote "human", "Jones", and "saw" as in §1.4. This can 
be expressed in words 



Vx{Hsx -► JeSx) 



(17) 



Ui Jones saw everyonei 



(18) 



or in terms of the quantifier everyonei 



Everyone Jones saw. 



(19) 



where yi the trace left by the movement of the word everyonei 
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6.4 Wh-lowering. 

This can again be represented by diagram six, and corresponds to §1.5. In 
term of symbols 

Wh (x),xeH, JeSx. (20) 
which can be expressed in words 

y l did Jones see who 1 ? (21) 

or filling in the trace 

Who did Jones see? (22) 

7 Conclusion. 

The T-model is explicitly a serial sentence production model. It is the ba- 
sic framework in which modern linguistics is currently understood, linguists 
being mainly occupied with describing movement between D-structure and 
S-structure. In psychology there are also serial sentence production models 
which bear little resemblance to the T-model. Rather than try to directly 
accommodate the T-model into a psycholinguistic serial model here an en- 
tirely new model has been constructed based upon altering the T-model 
from what the author perceives to be its defects. One of the defects is that 
the T-model requires that a S-structure sentence can, after it has been ut- 
tered, be analyzed into a formal component called LF. This analysis requires 
movement analogous to that between D-structure and S-structure. There is 
no apparent sentence production purpose for doing such a post-hoc analysis, 
the reason seems to be that historically given sentences were analyzed for 
their logical content. Here it was argued that the correct place for LF is 
before D-structure. This fits in naturally into a processing model of sen- 
tence production, where the LF can be thought of as part of the message 
level. Despite expressing reservations about string order in LF (at least as 
it is commonly understood), it was shown in some simple cases how move- 
ment can be described between LF and D-structure. In this case description 
of movement has a purpose as it elucidates part of a sentence production 
model. From the point of view of a linguist the most important require- 
ment of any model is whether there is movement between a D-structure and 
S-structure. This is retained in the P-model. 
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